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1 SEQUENCE LISTING 

2 

3 

4 (1) GENERAL INFORMATION: 
5 

6 (i) APPLICANT: Darrell Anderson, Nabil Hanna, John Leonard, 

7 Roland Nevnnan and Mitchell Re££ 
8 

9 (ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF CHIMERIC 

10 ANTIBODY TO HUMAN B LYMPHOCYTE 
11 

12 (iii) NUMBER OF SEQUENCES: 8 
13 

14 (iv} CORRESPONDING ADDRESS: 

15 (A) ADDRESSEE: IDEC Pharmaceuticals Corporation 

16 (B) STREET: 11099 N. Torrey Pines Road, #160 

17 (C) CITY: La Jolla 

18 (D) STATE: California 

19 (E) COUNTRY: USA 

20 (F) ZIP: 92037 
21 

22 (v) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

24 (B) COMPUTER: Macintosh 

25 (C) OPERATING SYSTEM: MS. DOS 

26 (D) SOFTWARE: Microsoft Word 5.0 
27 

28 (vi) CURRENT APPLICATION DATA: 

29 (A) APPLICATION NUMBER: US/07/978, 891B 

30 (B) FILING DATE: 13 NOV 1992 

31 (C) CLASSIFICATION: 424 
32 

33 (viii) ATTORNEY/AGENT INFORMATION: 

34 (A) NAME: Burgoon, Richard P. Jr. 

35 (B) REGISTRATION NUMBER: 34,787 

36 (C) REFERENCE /DOCKET NUMBER: 
37 

38 (ix) TELECOMMUNICATION INFORMATION: 

39 (A) TELEPHONE: (619) 458-0600 

40 (B) TELEFAX: (619) 546-9274 
41 

42 (2) INFORMATION FOR SEQ ID NO: 1: 

43 (i) SEQXTENCE CHARACTERISTICS: 

44 (A) LENGTH: 8540 bases 

45 (B) TYPE: nucleic acid 

46 (C) STRANDEDNESS : single 

47 (D) TOPOLOGY: circular 
48 

49 (ii) MOLECULE TYPE: DNA (genomic) 
50 

51 (iii) HYPOTHETICAL: no 
52 
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53 (iv) ANTI- SENSE: no 

54 

55 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

56 

57 GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 
58 

59 AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 
60 

61 GGA6AATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 
62 

63 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 
64 

65 GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 
66 

67 GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 
68 

69 AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 
70 

71 GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 
72 

73 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 
74 

75 TGGGTGGACT ATTTAC6GTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 
76 

77 AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 
78 

79 ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 
80 

81 ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 
82 

83 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 
84 

85 GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 
86 

87 CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 
88 

89 CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 
90 

91 CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 
92 

93 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 
94 

95 TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 
96 

97 CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 
98 

99 AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1320 
100 

101 TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1380 
102 

103 TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 1440 
104 
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G6CCGTGATA 
GTTTGCCCCT 
TAATAAAATG 
G66GT666GC 
6C66TGGGCT 
ACGTCAATGA 
TTCCTACTTG 
GGCAGTACAT 
CCATTGACGT 
GTAACAACTC 
TAAGCAGA6C 
AT6GGTT6GA 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCT6A 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCCTCTTCC 
TGCGTGGTGG 
GGCGTGGAGG 
CGTGTGGTCA 
TGCAAGGTCT 
G6GCAGCCCC 
AACCAGGTCA 
TGGGAGAGCA 



TCTACGTATG 
CCCCCGTGCC 
AGGAAATTGC 
A6GACAGCAA 
CTATGGAACC 
CGGTAAATGG 
GCAGTACATC 
CAATG66CGT 
CAAT6GGAGT 
CGCCCCATTG 
TGGGTACGTC 
GCCTCATCTT 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 
GA6AACCACA 
GCCTGACCTG 
ATGGGCAGCC 



ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
G6G66AGGAT 
AGCTGGGGCT 
CCCGCCTGGC 
TACGTATTAG 
G6ATAGCGGT 
TTGTTTTGGC 
ACGCAAATGG 
CTCACATTCA 
GCTCTTCCTT 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 
GGTGTACACC 
CCT6GTCAAA 
GGAGAACAAC 



ACTGTGCCTT 
CTGGAAGGTG 
CTGAGTAGGT 
TGG6AA6ACA 
C6ACAGCTAT 
ATTATGCCCA 
TCATCGCTAT 
TTGACTCACG 
ACCAAAATCA 
GCGGTAGGCG 
6TGATCAGCA 
GTCGCTGTTG 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GT6GACAAGA 
GCACCTGAAC 
CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
CCCATCGAGA 
CTGCCCCCAT 
GGCTTCTATC 
TACAAGACCA 



CTAGTTGCCA 
CCACTCCCAC 
GTCATTCTAT 
ATAGCA66CA 
6CCAAGTACG 
6TACATGACC 
TACCATGGTG 
GGGATTTCCA 
ACGGGACTTT 
TGTACGGTGG 
CTGAACACA6 
CTACGCGTGT 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGCAGAGCC 
TCCTGGG6GG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATG6CAA 
AAACCATCTC 
CCC66GATGA 
CCAGCGACAT 
CGCCTCCCGT 



GCCATCTGTT 1500 
TGTCCTTTCC 1560 
TCTGGGGGGT 1620 
T6CTGGGGAT 1680 
CCCCCTATTG 1740 
TTATGGGACT 1800 
ATGCGGTTTT 1860 
AGTCTCCACC 1920 
CCAAAATGTC 1980 
GAGGTCTATA 2040 
ACCCGTCGAC 2100 
CGCTAGCACC 2160 
GGGCACAGCG 2220 
GTGGAACTCA 2280 
AGGACTCTAC 2340 
CTACATCTGC 2400 
CAAATCTTGT 2460 
ACCGTCAGTC 2520 
TGAGGTCACA 2580 
GTACGTGGAC 2640 
CAGCACGTAC 2700 
GGAGTACAAG 2760 
CAAAGCCAAA 2820 
GCTGACCAAG 2880 
C6CCGTGGAG 2940 
GCTGGACTCC 3000 



Page : 4 



Raw Sequence Listing 
Patent J^lication US/07/978, 891B 



05/27/93 
16:25:02 
S4547 .raw 



157 
158 
159 
160 
161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 
199 
200 
201 
202 
203 
204 
205 
206 
207 
208 



GACGGCTCCT 
AACGTCTTCT 
CTCTCCCTGT 
TCGT6ACAAC 
GCCAGCCATC 
CCACTGTCCT 
CTATTCTGGG 
GGCATGCTGG 
CCC6ATCCCC 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CAAACTTGAC 
GTTCGACCAT 
GACCTACCCT 
TCTTCAGTGG 
CCTGAGAAGA 
GAACCACCAC 
GAACAACCGG 
TACCAGGAAG 
GAATTTGAAA 
GAATACCCAG 
GTCTACGAGA 



TCTTCCTCTA 
CATGCTCCGT 
CTCCGGGTAA 
ATGCGGCCGT 
TGTTGTTTGC 
TTCCTAATAA 
GGGTGGGGTG 
GGATGCG6TG 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
GGCAATCCTA 
TGAACTGCAT 
GGCCTCCGCT 
AAGGTAAACA 
ATCGACCTTT 
GAGGAGCTCA 
AATTGGCAAG 
CCATGAATCA 
GTGACACGTT 
GCGTCCTCTC 
A6AAAGACTA 



CAGCAAGCTC 
GATGCATGAG 
ATGAGGATCC 
GATATCTACG 
CCCTCCCCCG 
AATGAGGAAA 
GGGCAGGACA 
GGCTCTATGG 
CTCAATTTCT 
A6TTGATT6A 
ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATA6 
6C6TGAAGGC 
CGTCGCCGTG 
CAGGAACGAG 
GAATCTGGTG 
AAAGGACA6A 
TTTTCTTGCC 
TAAAGTAGAC 
ACCAGGCCAC 
TTTCCCAGAA 
TGAGGTCCAG 
ACAG6AA6AT 



ACCGTG6ACA 
GCTCTGCACA 
GTTAACGGTT 
TATGATCAGC 
TGCCTTCCTT 
TTGCATCGCA 
6CAAGGGGGA 
AACCAGCTGG 
TATTTGCATA 
GCAAAT6CGT 
ACATTATTCA 
CTAGGGAGAA 
GGGCCAATCT 
GGTAGGATCA 
CTTGGACAGC 
TGGTAGGATT 
TCCCAAAATA 
TTCAAGTACT 
ATTATGGGTA 
ATTAATATAG 
AAAAGTTTGG 
ATGGTTTGGA 
CTTAGACTCT 
ATT6ATTTGG 
GAGGAAAAAG 
GCTTTCAAGT 



AGAGCAGGT6 
ACCACTACAC 
ACCAACTACC 
CTCGACTGTG 
GACCCTGGAA 
TTGTCTGAGT 
GGATTGGGAA 
GGCTCGACAG 
ATGAGAAAAA 
T6CCAAAAAG 
GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
TCAGGGCTGC 
TTATCCCCGC 
TGGGGATTGG 
TCCAAAGAAT 
GGAAAACCT6 
TTCTCAGTA6 
ATGATGCCTT 
TAGTC6GAGG 
TTGTGACAA6 
GGAAATATAA 
6CATCAAGTA 
TCTCTGCTCC 



GCAGCAGGGG 3060 
GCAGAAGA6C 3120 
TAGACTGGAT 3180 
CCTTCTAGTT 3240 
GGTGCCACTC 3300 
AGGTGTCATT 3360 
6ACAATAGCA 3420 
CGCTGGATCT 3480 
AAGGAAAATT 3540 
GATGCTTTAG 3600 
CCAGAGCTGA 3660 
CATCACCGAA 3720 
GATAGAGAGG 3780 
ACATTTGCTT 3840 
GATTTCGCGC 3900 
TGCCATCATG 3960 
CAAGAACGGA 4020 
GACCACAACC 4080 
GTTCTCCATT 4140 
AGAACTCAAA 4200 
AA6ACTTATT 4260 
CAGTTCTGTT 4320 
GATCATGCAG 4380 
ACTTCTCCCA 4440 
TAAGTTTGAA 4500 
CCTCCTAAAG 4560 
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209 CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4620 
210 

211 TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCT66AA6G 4680 
212 

213 TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGA6GAAATT GCATCGCATT GTCTGAGTA6 4740 
214 

215 GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 
216 

217 CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 
218 

219 TAGCTTTGCT TCTCAATTTC TTATTTGCAT AAT6AGAAAA AAAGGAAAAT TAATTTTAAC 4920 
220 

221 ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGT6T 4980 
222 

223 TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 
224 

225 GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 
226 

227 CGTA6AGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 
228 

229 AGGGCAGAGC ATATAA6GTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 
230 

231 TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 
232 

233 TCCGGCCGCT TG6GTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 5340 
234 

235 CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 
236 

237 CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 5460 
238 

239 CACGACGGGC GTTCCTT6CG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 5520 
240 

241 GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 5580 
242 

243 GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5640 
244 

245 CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCC6G 5700 
246 

247 TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 5760 
248 

249 CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGAT6C 5820 
250 

251 CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 
252 

253 GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 5940 
254 

255 GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 6000 
256 

257 GCAGCGCATC GCCTTCTATC GCCTTCTTGA CGAGTTCTTC TGAGCGGGAC TCTGGGGTTC 6060 
258 

259 GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTCGATTC CACCGCCGCC 6120 
260 
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261 TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 6180 
262 

263 C6CGGGGATC TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 6240 
264 

265 GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 6300 
266 

267 TCTAGTTGTG GTTTGTCCAA ACTCATCAAT CTATCTTATC ATGTCTGGAT CGCGGCCGCG 6360 
268 

269 ATCCCGTCGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 6420 
270 

271 CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 6480 
272 

273 AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 6540 
274 

275 ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGA6GC GGTTTGCGTA 6600 
276 

277 TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 6660 
278 

279 GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 6720 
280 

281 CAGGAAi^GAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 6780 
282 

283 TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 6840 
284 

285 GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 6900 
286 

287 CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 6960 
288 

289 CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 7020 
290 

291 TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 7080 
292 

293 TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 7140 
294 

295 CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTT6A 7200 
296 

297 AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 7260 
298 

299 AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 7320 
300 

301 GTAGCGGT6G TTTTTTTGTT T6CAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 7380 
302 

303 AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 7440 
304 

305 GGATTTTGGT CATGAGATTA TCAAAAA66A TCTTCACCTA GATCCTTTTA AATTAAAAAT 7500 
306 

307 GAAGTTTTAA ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 7560 
308 

309 TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7620 
310 

311 TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7680 
312 
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313 


TGATACCGCG 


AGACCCACGC 


TCACCGGCTC 


CAGATTTATC 


A6CAATAAAC 


CAGCCAGCCG 


7740 


314 


















315 


GAA66GCCGA 


GCGCAGAAGT 


GGTCCTGCAA 


CTTTATCCGC 


CTCCATCCAG 


TCTATTAATT 


7800 


316 


















317 


GTTGCCGG6A 


AGCTAGAGTA 


AGTAGTTCGC 


CA6TTAATAG 


TTTGCGCAAC 


GTTGTTGCCA 


7860 


318 


















319 


TTGCTACAGG 


CATCGTGGTG 


TCACGCTCGT 


CGTTTGGTAT 


GGCTTCATTC 


AGCTCCGGTT 


7920 


320 


















321 


CCCAACGATC 


AAGGCGAGTT 


ACATGATCCC 


CCATGTTGTG 


CAAAAAA6CG 


GTTAGCTCCT 


7980 


322 


















323 


TCGGTCCTCC 


GATCGTTGTC 


AGAAGTAAGT 


TGGCCGCAGT 


GTTATCACTC 


ATGGTTATGG 


8040 


324 


















325 


CAGCACT6CA 


TAATTCTCTT 


ACTGTCATGC 


CATCCGTAAG 


ATGCTTTTCT 


GTGACTGGTG 


8100 


326 


















327 


AGTACTCAAC 


CAAGTCATTC 


T6AGAATAGT 


GTATGCGGCG 


ACCGAGTTGC 


TCTTGCCCGG 


8160 


328 


















329 


CGTCAATACG 


GGATAATACC 


GCGCCACATA 


GCAGAACTTT 


AAAAGTGCTC 


ATCATTGGAA 


8220 


330 


















331 


AACGTTCTTC 


666GCGAAAA 


CTCTCAAGGA 


TCTTACCGCT 


6TTGAGATCC 


AGTTCGATGT 


8280 


332 


















333 


AACCCACTCG 


TGCACCCAAC 


TGATCTTCAG 


CATCTTTTAC 


TTTCACCAGC 


GTTTCTGGGT 


8340 


334 


















335 


GAGCAAAAAC 


AGGAAGGCAA 


AATGCCGCAA 


AAAAGGGAAT 


AAGGGC6ACA 


CGGAAATGTT 


8400 


336 


















337 


GAATACTCAT 


ACTCTTCCTT 


TTTCAATATT 


ATTGAAGCAT 


TTATCAGGGT 


TATTGTCTCA 


8460 


338 


















339 


TGAGCGGATA 


CATATTTGAA 


TGTATTTAGA 


AAAATAAACA 


AATAG66GTT 


CCGCGCACAT 


8520 


340 


















341 


TTCCCCGAAA 


AGTGCCACCT 










8540 


342 


















343 


















344 


(3) 


INFORMATION FOR 


SEQ ID NO: 


2: 








345 


















346 


(i) 


SEQUENCE CHARACTERISTICS: 










347 


















348 


(A) 


LENGTH: 9209 bases 










349 


(B) 


TYPE: 


: nucleic acid 










350 


(C) 


STRANDEDNESS : single 










351 


(D) 


TOPOLOGY : c 1 r cul ar 










352 


















353 


(ii) 


MOLECULE TYPE: 


DNA (genomic) 








354 


















355 


(iii) 


HYPOTHETICAL: no 










356 


















357 


(iv) 


ANTI' 


-SENSE: no 












358 


















359 


(ix) 


SEQUENCE DESCRIPTION: SEQ 


ID NO: 2: 








360 


















361 


















362 


GACGTCGCGG 


CCGCTCTAGG 


CCTCCAAAAA 


AGCCTCCTCA 


CTACTTCTGG 


AATAGCTCA6 


60 


363 


















364 


AGGCCGAG6C 


GGCCTCGGCC 


TCTGCATAAA 


TAAAAAAAAT 


TAGTCAGCCA 


TGCATGGGGC 


120 
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365 
366 
367 
368 
369 
370 
371 
372 
373 
374 
375 
376 
377 
378 
379 
380 
381 
382 
383 
384 
385 
386 
387 
388 
389 
390 
391 
392 
393 
394 
395 
396 
397 
398 
399 
400 
401 
402 
403 
404 
405 
406 
407 
408 
409 
410 
411 
412 
413 
414 
415 
416 



6GAGAAT6GG 

ATGGTTGCTG 

GACTTTCCAC 

GGGGAGCCTG 

AGTTATTAAT 

GTTACATAAC 

AC6TCAATAA 

TGGGT66ACT 

AGTACGCCCC 

ATGACCTTAT 

ATGGTGATGC 
TTTCCAAGTC 

GACTTTCCAA 

CGGTGGGAGG 

CATCACAGAT 

GCTTCAGTCA 

GCATCTCCAG 

CACTGGTTCC 

CTGGCTTCTG 

ACAATCAGCA 

AACCCACCCA 

TCTGTCTTCA 

TGCCTGCTGA 

CTCCAATCGG 

AGCCTCAGCA 

TGCGAAGTCA 



CGGAACTGGG 

ACTAATTGAG 

ACCTGGTTGC 

GGGACTTTCC 

AGTAATCAAT 

TTACG6TAAA 

TGACGTATGT 

ATTTACGGTA 

CTATTGACGT 

GGGACTTTCC 

GGTTTTGGCA 
TCCACCCCAT 

AATGTCGTAA 

TCTATATAAG 

CTCTCACTAT 

TAATGTCCAG 

GGGAGAAGGT 

AGCAGAAGCC 

GAGTCCCTGT 

GAGT6GAGGC 

CGTTCGGAGG 

TCTTCCCGCC 

ATAACTTCTA 

GTAACTCCCA 

GCACCCTGAC 

CCCATCAGGG 



CGGAGTTAGG 

ATGCATGCTT 

TGACTAATTG 

ACACCCTAAC 

TACGGGGTCA 

TGGCCCGCCT 

TCCCATAGTA 

AACTGCCCAC 

CAATGACGGT 

TACTT6GCA6 

GTACATCAAT 
TGACGTCAAT 

CAACTCCGCC 

CAGAGCTGGG 

GGATTTTCAG 

AGGACAAATT 

CACAATGACT 

AGGATCCTCC 

TCGCTTCAGT 

TGAAGATGCT 

GGGGACCAAG 

ATCTGATGAG 

TCCCAGAGAG 

GGAGAGT6TC 

GCTGAGCAAA 

CCTGAGCTCG 



GGC6GGATG6 

TGCATACTTC 

AGATGCATGC 

TGACACACAT 

TTAGTTCATA 

G6CTGACC6C 

ACGCCAATAG 

TTGGCAGTAC 

AAATG6CCC6 

TACATCTACG 

6G6CGTG6AT 
GGGAGTTTGT 

CCATTGACGC 

TACGTGAACC 

GTGCAGATTA 

GTTCTCTCCC 

TGCAGGGCCA 

CCCAAACCCT 

GGCAGTGGGT 

GCCACTTATT 

CTGGAAATCA 

CAGTTGAAAT 

GCCAAAGTAC 

ACAGAGCAGG 

6CAGACTACG 

CCCGTCACAA 



GCGGAGTTAG 

TGCCTGCTGG 

TTTGCATACT 

TCCACAGAAT 

GCCCATATAT 

CCAACGACCC 

GGACTTTCCA 

ATCAAGTGTA 

CCTGGCATTA 

TATTAGTCAT 

A6CGGTTTGA 
TTTGGCACCA 

AAATGG6CGG 

GTCAGATCGC 

TCAGCTTCCT 

AGTCTCCAGC 

GCTCAAGTGT 

GGATTTATGC 

CTGGGACTTC 

ACTGCCAGCA 

AACGTACGGT 

CTGGAACTGC 

AGTGGAAGGT 

ACA6CAAGGA 

A6AAACACAA 

AGAGCTTCAA 



GGGCGGGACT 

GGAGCCTGGG 

TCTGCCTGCT 

TAATTCCCCT 

GGAGTTCCGC 

CCGCCCATTG 

TTGACGTCAA 

TCATATGCCA 

TGCCCAGTAC 

CGCTATTACC 

CTCACGGGGA 
AAATCAACGG 

TAGGCGTGTA 

CTGGAGACGC 

GCTAATCAGT 

AATCCTGTCT 

AAGTTACATC 

CACATCCAAC 

TTACTCTCTC 

GT66ACTAGT 

GGCTGCACCA 

CTCTGTTGTG 

GGATAACGCC 

CAGCACCTAC 

AGTCTACGCC 

CAGGGGAGAG 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
840 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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417 

418 

419 

420 

421 

422 

423 

424 

425 

426 

427 

428 

429 

430 

431 

432 

433 

434 

435 

436 

437 

438 

439 

440 

441 

442 

443 

444 

445 

446 

447 

448 

449 

450 

451 

452 

453 

454 

455 

456 

457 

458 

459 

460 

461 

462 

463 

464 

465 

466 

467 

468 



TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTC6T GACAACATGC 1740 

GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTT6CCA GCCATCTGTT 1800 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1860 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1920 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1980 

GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 2040 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 2100 

TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 2160 

GGCAGTACAT CAATGGGCGT GGATA6CGGT TT6ACTCACG GGGATTTCCA AGTCTCCACC 2220 

CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 2280 

GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2340 

TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2400 

ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 2460 

GTACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 2520 

TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 2580 

GGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 2640 

CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCA6CAC AGCCTACATG 2700 

CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 2760 

TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 2820 

GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 2880 

GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 2940 

TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 3000 

GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCA6ACC 3060 

TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGCAGAGCCC 3120 

AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 3180 

CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 3240 
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469 
470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
481 
482 
483 
484 
485 
486 
487 
488 
489 
490 
491 
492 
493 
494 
495 
496 
497 
498 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 
511 
512 
513 
514 
515 
516 
517 
518 
519 
520 



GA6GTCACAT 
TAC6T66AC6 
AGCACGTACC 
GAGTACAAGT 
AAA6CCAAA6 
CTGACCAAGA 
6CCGT6GAGT 
CTGGACTCCG 
CAGCAGGGGA 
CAGAAGAGCC 
AGACTGGATT 
CTTCTAGTTG 
GTGCCACTCC 
GGTGTCATTC 
ACAATA6CA6 
6CTGGATCTC 
AG6AAAATTA 
ATGCTTTAGA 
CA6A6CTGAG 
ATCACCGAAG 
ATAGA6A666 
CATTTGCTTC 
ATTTCGCGCC 
6CCATCATGG 
AAGAAC6GA6 
ACCACAACCT 



GCGTGGTG6T 
6C6TGGAG6T 
GTGTGGTCAG 
GCAAGGTCTC 
GGCAGCCCCG 
ACCAGGTCAG 
G66AGAGCAA 
ACGGCTCCTT 
ACGTCTTCTC 
TCTCCCTGTC 
C6TGACAACA 
CCAGCCATCT 
CACTGTCCTT 
TATTCTGGGG 
GCATGCTGGG 
CCGATCCCCA 
ATTTTAACAC 
GACAGTGTTC 
ACTCCTAAGC 
CCTGATTCCG 
CAG6A6CCAG 
TGACATAGTT 
AAACTTGAC6 
TTCGACCATT 
ACCTACCCTG 
CTTCAGTGGA 



G6ACGTGAGC 
GCATAATGCC 
CGTCCTCACC 
CAACAAAGCC 
A6AACCACA6 
CCTGACCTGC 
TGGGCAGCCG 
CTTCCTCTAC 
ATGCTCCGTG 
TCCG6GTAAA 
TGCGGCCGTG 
GTTGTTTGCC 
TCCTAATAAA 
GGTGGGGTGG 
GAT6CGGTGG 
GCTTTGCTTC 
CAATTCAGTA 
TCTGCACAGA 
CA6TGAGTGG 
TAGAGCCACA 
GGCAGAGCAT 
GTGTTGGGAG 
GCAATCCTAG 
GAACTGCATC 
GCCTCCGCTC 
AGGTAAACAG 



CAC6AAGACC 
AAGACAAAGC 
GTCCTGCACC 
CTCCCAGCCC 
GTGTACACCC 
CTGGTCAAAG 
GAGAACAACT 
AGCAA6CTCA 
ATGCATGAGG 
TGA66ATCCG 
ATATCTACGT 
CCTCCCCCGT 
ATGAGGAAAT 
6GCAGGACAG 
GCTCTATGGA 
TCAATTTCTT 
GTTGATTGA6 
TAAGGACAAA 
CACA6CATTC 
CCTTGGTAAG 
ATAA6GTGA6 
CTTGGATAGC 
CGTGAAGGCT 
GTCGCCGTGT 
AGGAACGAGT 
AATCT6GTGA 



CTGAGGTCAA 
CGCGGGA6GA 
AGGACTGGCT 
CCATCGAGAA 
TGCCCCCATC 
GCTTCTATCC 
ACAAGACCAC 
CCGTGGACAA 
CTCTGCACAA 
TTAACGGTTA 
ATGATCAGCC 
GCCTTCCTTG 
TGCATCGCAT 
CAAG66GGAG 
ACCAGCTGGG 
ATTTGCATAA 
CAAATGCGTT 
CATTATTCAG 
TAGGGAGAAA 
GGCCAATCTG 
GTAGGATCAG 
TTGGACAGCT 
6GTAGGATTT 
CCCAAAATAT 
TCAAGTACTT 
TTATGGGTAG 



GTTCAACT6G 
GCAGTACAAC 
GAATGGCAA6 
AACCATCTCC 
CCGGGATGAG 
CA6C6ACATC 
GCCTCCCGTG 
GAGCAGGT6G 
CCACTACACG 
CCAACTACCT 
TCGACTGT6C 
ACCCTGGAAG 
TGTCTGAGTA 
GATTG6GAAG 
GCTCGACA6C 
TGA6AAAAAA 
GCCAAAAA6G 
AGGGAGTACC 
TATGCTTGTC 
CTCACACAGG 
TTGCTCCTCA 
CAGGGCTGCG 
TATCCCCGCT 
GGGGATT6GC 
CCAAAGAATG 
GAAAACCTGG 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
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521 
522 
523 
524 
525 
526 
527 
528 
529 
530 
531 
532 
533 
534 
535 
536 
537 
538 
539 
540 
541 
542 
543 
544 
545 
546 
547 
548 
549 
550 
551 
552 
553 
554 
555 
556 
557 
558 
559 
560 
561 
562 
563 
564 
565 
566 
567 
568 
569 
570 
571 
572 



TTCTCCATTC 
GAACTCAAA6 
AGACTTATTG 
AGTTCT6TTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTT6AAG 
CTCCTAAAGC 
GACTGTGCCT 
CCTGGAAGGT 
TCTGAGTAGG 
TTG6GAAGAC 
TCGAGCTACT 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAA6 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CGCAGGTTCT 
AATCGGCTGC 
TGTCAAGACC 
GTGGCTGGCC 
AAGGGACTGG 
TCCTGCCGAG 
GGCTACCTGC 



CTGAGAAGAA 
AACCACCACG 
AACAACC6GA 
ACCAGGAAGC 
AATTTGAAAG 
AATACCCAGG 
TCTACGAGAA 
TATGCATTTT 
TCTAGTTGCC 
GCCACTCCCA 
TGTCATTCTA 
AATA6CAGGC 
AGCTTTGCTT 
CCAATTCAGT 
CTCTGCACAG 
CCA6TGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
CCGGCCGCTT 
TCTGATGCCG 
GACCTGTCCG 
ACGAC6GGCG 
CTGCTATTGG 
AAAGTATCCA 
CCATTCGACC 



TCGACCTTTA 
AGGAGCTCAT 
ATTGGCAAGT 
CATGAATCAA 
TGACACGTTT 
CGTCCTCTCT 
GAAAGACTAA 
TATAA6ACCA 
AGCCATCTGT 
CTGTCCTTTC 
TTCTGGG6GG 
AT6CTG6GGA 
CTCAATTTCT 
AGTTGATTGA 
ATAA6GACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATCG 
GGGT66AGAG 
CCGTGTTCCG 
GTGCCCTGAA 
TTCCTTGCGC 
GCGAAGTGCC 
TCATGGCTGA 
ACCAAGCGAA 



AAG6ACAGAA 
TTTCTTGCCA 
AAAGTAGACA 
CCAGGCCACC 
TTCCCAGAAA 
GAGGTCCAGG 
CAGGAA6ATG 
TGGGACTTTT 
TGTTTGCCCC 
CTAATAAAAT 
TGGGGTGGGG 
TGCGGTGGGC 
TATTTGCATA 
GCAAATGCGT 
ACATTATTCA 
CTAGG6AGAA 
GGGCCAATCT 
GGTAG6ATCA 
ATCCTCTATG 
GCTATTCGGC 
GCTGTCAGCG 
TGAACTGCAG 
AGCTGTGCTC 
GGGGCAGGAT 
TGCAATGCGG 
ACATCGCATC 



TTAATATAGT 
AAAGTTTGGA 
TGGTTTGGAT 
TTAGACTCTT 
TTGATTTGGG 
AGGAAAAAGG 
CTTTCAAGTT 
GCTGGCTTTA 
TCCCCCGTGC 
GAGGAAATTG 
CAGGACAGCA 
TCTATGGAAC 
ATGAGAAAAA 
TGCCAAAAAG 
GAGGGAGTAC 
ATAT6CTTGT 
GCTCACACAG 
GTTGCTCCTC 
GTTGAACAAG 
TATGACTGGG 
CAGG6GCGCC 
GAC6AGGCAG 
GACGTTGTCA 
CTCCTGTCAT 
CGGCTGCATA 
GAGC6AGCAC 



TCTCAGTAGA 
TGATGCCTTA 
AGTCGGAGGC 
TGTGACAA6G 
GAAATATAAA 
CATCAAGTAT 
CTCTGCTCCC 
GATCAGCCTC 
CTTCCTTGAC 
CATCGCATTG 
AGGGGGAGGA 
CAGCTGGGGC 
AAGGAAAATT 
GATGCTTTAG 
CCA6AGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
ATGGATTGCA 
CACAACAGAC 
CGGTTCTTTT 
CGCGGCTATC 
CTGAA6CGGG 
CTCACCTTGC 
CGCTTGATCC 
GTACTCGGAT 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
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573 
574 
575 
576 
577 
578 
579 
580 
581 
582 
583 
584 
585 
586 
587 
588 
589 
590 
591 
592 
593 
594 
595 
596 
597 
598 
599 
600 
601 
602 
603 
604 
605 
606 
607 
608 
609 
610 
611 
612 
613 
614 
615 
616 
617 
618 
619 
620 
621 
622 
623 
624 



66AAGCCG6T 
CGAACTGTTC 
TGGCGATGCC 
CTGTGGCC6G 
TGCTGAAGAG 
TCCCGATTCG 
CTGGGGTTCG 
ACCGCCGCCT 
ATCCTCCAGC 
GCTTATAATG 
TCACTGCATT 
6C66CC6C6A 
ATTGTTATCC 
GGGGTGCCTA 
AGTCGGGAAA 
GTTTGCGTAT 
GGCTGCGGCG 
GGGATAACGC 
AGGCCGCGTT 
GAC6CTCAAG 
CTGGAA6CTC 
CCTTTCTCCC 
CGGTGTAGGT 
GCTGCGCCTT 
CACTGGCAGC 
AGTTCTTGAA 



CTTGTCGATC 
GCCAGGCTCA 
TGCTTGCCGA 
CT6GGTGTGG 
CTTGGCGGCG 
CAGCGCATCG 
AAATGACCGA 
TCTAT6AAAG 
GC6GGGATCT 
GTTACAAATA 
CTAGTTGTGG 
TCCCGTCGAG 
GCTCACAATT 
ATGAGTGAGC 
CCTGTCGTGC 
TGGGCGCTCT 
AGCGGTATCA 
A66AAAGAAC 
GCTGGCGTTT 
TCAGAGGTGG 
CCTCGTGC6C 
TTC6GGAA6C 
CGTTCGCTCC 
ATCCGGTAAC 
AGCCACTGGT 
GTGGTGGCCT 



AGGATGATCT 
AGGCGCGCAT 
ATATCATGGT 
CGGACCGCTA 
AATGGGCTGA 
CCTTCTATCG 
CCAAGCGACG 
GTTGGGCTTC 
CATGCTGGAG 
AAGCAATAGC 
TTTGTCCAAA 
AGCTTGGCGT 
CCACACAACA 
TAACTCACAT 
CAGCTGCATT 
TCCGCTTCCT 
GCTCACTCAA 
ATGTGAGCAA 
TTCCATAGGC 
CGAAACCCGA 
TCTCCTGTTC 
GTGGCGCTTT 
AAGCT66GCT 
TATCGTCTTG 
AACAGGATTA 
AACTACGGCT 



G6ACGAAGAG 
GCCCGACGGC 
GGAAAAT66C 
TCAGGACATA 
CCGCTTCCTC 
CCTTCTTGAC 
CCCAACCTGC 
GGAATCGTTT 
TTCTTCGCCC 
ATCACAAATT 
CTCATCAATC 
AATCATGGTC 
TACGAGCCGG 
TAATTGCGTT 
AATGAATCGG 
CGCTCACTGA 
AGGCGGTAAT 
AAGGCCAGCA 
TCCGCCCCCC 
CAGGACTATA 
CGACCCTGCC 
CTCAATGCTC 
6TGTGCACGA 
AGTCCAACCC 
GCAGAGCGAG 
ACACTAGAAG 



CATCAGGGGC 
GAGGATCTCG 
CGCTTTTCTG 
6CGTTGGCTA 
GTGCTTTACG 
GAGTTCTTCT 
CATCACGAGA 
TCCGGGAC6C 
ACCCCAACTT 
TCACAAATAA 
TATCTTATCA 
ATAGCTGTTT 
AAGCATAAAG 
GCGCTCACTG 
CCAACGCGCG 
CTCGCTGCGC 
ACGGTTATCC 
AAAGGCCAGG 
TGACGAGCAT 
AAGATACCAG 
GCTTACCGGA 
ACGCTGTA6G 
ACCCCCCGTT 
GGTAAGACAC 
GTATGTAGGC 
GACAGTATTT 



TCGCGCCAGC 
TCGTGACCCA 
GATTCATCGA 
CCC6TGATAT 
6TATCGCCGC 
GAGCG6GACT 
TTTCGATTCC 
C66CTGGAT6 
GTTTATTGCA 
AGCATTTTTT 
TGTCTGGATC 
CCTGTGTGAA 
TGTAAA6CCT 
CCCGCTTTCC 
GGGAGA66CG 
TCGGTCGTTC 
ACAGAATCAG 
AACC6TAAAA 
CACAAAAATC 
GCGTTTCCCC 
TACCTGTCCG 
TATCTCAGTT 
CAGCCCGACC 
GACTTATCGC 
GGTGCTACA6 
G6TATCTGCG 



6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 



m 
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625 
626 
627 
628 
629 
630 
631 
632 
633 
634 
635 
636 
637 
638 
639 
640 
641 
642 
643 
644 
645 
646 
647 
648 
649 
650 
651 
652 
653 
654 
655 
656 
657 
658 
659 
660 
661 
662 
663 
664 
665 
666 
667 
668 
669 
670 
671 
672 
673 
674 
675 
676 



CTCT6CTGAA 
CCACCGCTGG 
GATCTCAAGA 
CACGTTAAG6 
ATTAAAAATG 
ACCAATGCTT 
TTGCCTGACT 
GTGCTGCAAT 
AGCCAGCCGG 
CTATTAATTG 
TTGTTGCCAT 
GCTCCGGTTC 
TTAGCTCCTT 
TGGTTATGGC 
TGACTGGTGA 
CTT6CCCGGC 
TCATTGGAAA 
GTTCGATGTA 
TTTCTGGGTG 
GGAAATGTTG 
ATTGTCTCAT 
CGCGCACATT 



GCCAGTTACC 
TAGCGGTGGT 
AGATCCTTTG 
GATTTTGGTC 
AAGTTTTAAA 
AATCAGTGAG 
CCCCGTCGTG 
GATACCGCGA 
AAGGGCCGAG 
TTGCCGGGAA 
TGCTACAG6C 
CCAACGATCA 
CGGTCCTCCG 
AGCACTGCAT 
GTACTCAACC 
GTCAATAC6G 
ACGTTCTTCG 
ACCCACTCGT 
AGCAAAAACA 
AATACTCATA 
6AGC66ATAC 
TCCCCGAAAA 



TTCGGAAAAA 
TTTTTTGTTT 
ATCTTTTCTA 
ATGA6ATTAT 
TCAATCTAAA 
GCACCTATCT 
TAGATAACTA 
GACCCACGCT 
CGCAGAAGTG 
GCTAGAGTAA 
ATCGTGGTGT 
AGGCGAGTTA 
ATCGTTGTCA 
AATTCTCTTA 
AAGTCATTCT 
GATAATACCG 
GGGCGAAAAC 
GCACCCAACT 
GGAAG6CAAA 
CTCTTCCTTT 
ATATTTGAAT 
GTGCCACCT 



GAGTTGGTAG 
GCAAGCAGCA 
CGGGGTCTGA 
CAAAAAGGAT 
GTATATATGA 
CAGCGATCTG 
CGATACG6GA 
CACCGGCTCC 
GTCCTGCAAC 
GTAGTTCGCC 
CACGCTCGTC 
CATGATCCCC 
GAAGTAA6TT 
CTGTCATGCC 
GAGAATA6TG 
CGCCACATAG 
TCTCAAGGAT 
GATCTTCAGC 
ATGCCGCAAA 
TTCAATATTA 
GTATTTA6AA 



CTCTTGATCC 
GATTACGCGC 
CGCTCAGTGG 
CTTCACCTAG 
GTAAACTTGG 
TCTATTTCGT 
GGGCTTACCA 
AGATTTATCA 
TTTATCCGCC 
AGTTAATAGT 
GTTTGGTATG 
CATGTTGTGC 
6GCC6CAGTG 
ATCCGTAAGA 
TATGCGGC6A 
CAGAACTTTA 
CTTACCGCTG 
ATCTTTTACT 
AAAG6GAATA 
TTGAAGCATT 
AAATAAACAA 



GGCAAACAAA 
A6AAAAAAA6 
AACGAAAACT 
ATCCTTTTAA 
TCTGACAGTT 
TCATCCATAG 
TCTGGCCCCA 
GCAATAAACC 
TCCATCCAGT 
TTGCGCAACG 
GCTTCATTCA 
AAAAAAGCG6 
TTATCACTCA 
TGCTTTTCTG 
CCGAGTTGCT 
AAAGTGCTCA 
TTGAGATCCA 
TTCACCAGCG 
A6GGCGACAC 
TATCAGGGTT 
ATAGGGGTTC 



7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9209 



(4) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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677 (ii) MOLECULE TYPE: DNA (genomic) 
678 

679 (iii) HYPOTHETICAL: no 
680 

681 (iv) ANTI-SENSE: no 
682 

683 (ix) SEQtTENCE DESCRIPTION: SEQ ID NO: 3: 

684 

685 ATC ACA GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT 45 

686 TC 47 
687 

688 (5) INFORMATION FOR SEQ ID NO: 4: 

689 (i) SEQUENCE CHARACTERISTICS: 

690 (A) LENGTH: 30 bases 

691 (B) TYPE: nucleic acid 

692 (C) STRANDEDNESS : single 

693 (D) TOPOLOGY: linear 
694 

695 (ii) MOLECULE TYPE: DNA (genomic) 
696 

697 (iii) HYPOTHETICAL: no 
698 

699 (iv) ANTI- SENSE: yes 
700 

701 (ix) SEQTTENCE DESCRIPTION: SEQ ID NO: 4: 

702 

703 TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT 30 
704 

705 (6) INFORMATION FOR SEQ ID NO: 5: 

706 

707 (i) SEQUENCE CHARACTERISTICS: 

708 

709 (A) LENGTH: 384 bases 

710 (B) TYPE: nucleic acid 

711 (C) STRANDEDNESS: single 

712 (D) TOPOLOGY: linear 
713 

714 (ii) MOLECULE TYPE: DNA (genomic) 
715 

716 (iii) HYPOTHETICAL: no 
717 

718 (iv) ANTI-SENSE: no 
719 

720 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

721 

722 

723 ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC AGT GCT TCA GTC 51 
724 

725 ATA ATG TCC AGA GGG CAA ATT 6TT CTC TCC CAG TCT CCA GCA ATC CTG TCT 102 
726 

727 GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT TGC AGG GCC AGC TCA AGT GTA 153 
728 



m 
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729 


ACT 


TAG ATC CAC TGG TTC GAG CAG AAG 


CCA 


730 










731 


ATT 


TAT GCC ACA TCC AAC CTG GCT TCT 


GGA 


732 










733 


ACT 


6GG TCT GGG ACT TCT TAC TCT CTC 


ACA 


734 










735 


GAT 


GCT GCC ACT TAT TAC TGC CAG CAG 


TGG 


736 










737 


GGA 


GGG GGG ACC AAG CTG GAA ATC AAA 




738 










739 










740 


(7) 




INFORMATION FOR SEQ ID NO; 6 




741 










742 


(i) 




SEQUENCE CHARACTERISTICS: 




743 










744 


(A) 




LENGTH: 27 bases 




745 


(B) 




TYPE: nucleic acid 




746 


(C) 




STRANDEDNESS : single 




747 


(D) 




TOPOLOGY: linear 




748 










749 


(ii) 




MOLECULE TYPE: DNA (genomic) 




750 










751 


(iii) 




HYPOTHETICAL: no 




752 










753 


(iv) 




ANTI- SENSE: no 




754 










755 


(ix) 




SEQXTENCE DESCRIPTION: SEQ ID 


NO: 


756 










757 


GCG GCT 


CCC ACG CGT GTC CTG TCC CAG 




758 










759 


(8) 




INFORMATION FOR SEQ ID NO: 7 




760 










761 


(i) 




SEQUENCE CHARACTERISTICS: 




762 










763 


(A) 




LENGTH: 29 bases 




764 


(B) 




TYPE: nucleic acid 




765 


(C) 




STRANDEDNESS : single 




766 


(D) 




TOPOLOGY : 1 ine ar 




767 










768 


(ii) 




MOLECULE TYPE: DNA (genomic) 




769 










770 


(iii) 




HYPOTHETICAL : no 




771 










772 


(iv) 




ANTI -SENSE: yes 




773 










774 


(ix) 




SEQUENCE DESCRIPTION: SEQ ID 


NO: 


775 










776 


GGS TGT 


TGT GCT AGC TGM RGA GAC RGT GA 


777 










778 


(9) 




INFORMATION FOR SEQ ID NO: 8 




779 










780 


(i) 




SEQUENCE CHARACTERISTICS: 





:C TCC CCC AAA CCC TGG 204 

:T GTT CGC TTC AGT GGC 255 

;C AGA 6TG GAG GCT GAA 306 

ST AAC CCA CCC ACG TTC 357 
384 



6: 
27 



7: 
29 
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781 

782 (A) LENGTH: 420 bases 

783 (B) TYPE: nucleic acid 

784 (C) STRAMDEDNESS : single 

785 (D) TOPOLOGY: linear 
786 

787 (ii) MOLECXTLE TYPE: DNA (genomic) 
788 

789 (iii) HYPOTHETICAL: no 
790 

791 (iv) ANTI-SENSE: no 
792 

793 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

794 

795 ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT GTC 51 
796 

797 CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG CCT GGG 102 
798 

799 GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT ACC AGT TAC 153 
800 

801 AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG GAA TGG ATT GGA 204 
802 

803 GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT CAG AAG TTC AAA GGC 255 
804 

805 AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAG CTC 306 
806 

807 AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC TAT TAC TGT GCA AGA TCG ACT 357 
808 

809 TAC TAC GGC GGT GAC TGG TAC TTC AAT GTC TGG GGC GCA GGG ACC ACG GTC 408 

810 ACC GTC TCT GCA 420 
811 



